A Shape-invariant Phase Vocoder for Speech Transformation

نویسنده

  • A. Röbel
چکیده

This paper proposes a new method for shape invariant realtime modification of speech signals. The method can be understood as a frequency domain SOLA algorithm that is using the phase vocoder algorithm for phase synchronization. Compared to time domain SOLA the new implementation provides improved time synchronization during overlap add and improved quality of the noise components of the transformed speech signals. The algorithm has been compared in two perceptual tests with recent implementations of PSOLA and HNM algorithms demonstrating a very satisfying performance. Due to the fact that the quality of transformed signals stays constant over a wide range of transformation parameters the algorithm is well suited for real-time gender and age transformations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Shape-invariant speech transformation with the phase vocoder

This paper proposes a new phase vocoder based method for shape invariant real-time modification of speech signals. The performance of the method with respect voiced and unvoiced signal components as well as some control strategies for the voiced/unvoiced balance of the transformed speech signals will be discussed. The algorithm has been compared in perceptual tests with an implementation of the...

متن کامل

Real Time Pitch Shifting with Formant Structure Preservation Using the Phase Vocoder

Pitch shifting in speech is presented based on the use of the phase vocoder in combination with spectral whitening and envelope reconstruction, applied respectively before and after the transformation. A band preservation technique is introduced to contain quality degradation when downscaling the pitch. The transposition ratio is fixed in advance by selecting analysis and synthesis window sizes...

متن کامل

Speech to chant transformation with the phase vocoder

The technique used for this composition is a semi automatic system for speech to chant conversion. The transformation is performed using an implementation of shapeinvariant signal modifications in the phase vocoder and a recent technique for envelope estimation that is denoted as True Envelope estimation. We first describe the compositional idea and give an overview of the preprocessing steps t...

متن کامل

Suppression of phasiness for time-scale modifications of speech signals based on a shape invariance property

Time-scale modifications of speech signals, based on frequency-domain techniques, are hampered by two important artifacts which are “phasiness” and “transient smearing”. They correspond to the destruction of the shape of the original signal, i.e. the de-synchronization between the phases of frequency components. This paper describes an algorithm that preserves the shape invariance of speech sig...

متن کامل

Using FFI Interpolator and VQ Quantization for Designing of High Quality 1200 BPS Speech Vocoder

Storaging or transmission of speech signals at very low bit rate is a hot area in the field of speech processing. We used stochastic inter-frame interpolators and vector quantization (VQ) as a new method for developing a high quality 1200 BPS speech vocoder. The objective and subjecgtive test results show that performance of the new vocoder is compairable with 4800 BPS standard vocoders (as CELP).

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010